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Abstract. In the framework of the semiclassical approach the universal spectral 
correlations in the Hamiltonian systems with classical chaotic dynamics can be 
attributed to the systematic correlations between actions of periodic orbits which 
(up to the switch in the momentum direction) pass through approximately the 
same points of the phase space. By considering symbolic dynamics of the system 
one can introduce a natural ultrametric distance between periodic orbits and or- 
ganize them into clusters. Each cluster consists of orbits approaching closely each 
other in the phase space. We study the distribution of cluster sizes for the backer's 
map in the asymptotic limit of long trajectories. This problem is equivalent to 
the one of counting degeneracies in the length spectrum of the de Bruijn graphs. 
Based on this fact, we derive the probability Vk that k randomly chosen periodic 
orbits belong to the same cluster. Furthermore, we find asymptotic behaviour 
of the largest cluster size jCmaxI and derive the probability P{t) that a random 
periodic orbit belongs to a cluster of the size smaller than t|C,nax|, t e [0, 1]. 



1. Introduction 

In their seminal paper [T] Bohigas Giannoni and Schmit conjectured that the local 
energy spectrum statistics of quantum systems with fully chaotic classical dynam- 
ics are universal and can be described by standard ensembles of Random Matrix 
Theory (RMT). To explore the origin of such universality semiclassical techniques 
based on the applications of the Gutzwiller trace formula was introduced by Berry 
in [2j . By this approach the correlations between energy levels of a quantum Hamil- 
tonian system can be related to the correlations between periodic orbit actions in 
the corresponding classical system. Within the diagonal approximation, where only 
correlations between periodic orbits themselves are taken into account, Berry man- 
aged to obtain the leading order of the universal spectral form factor. The diagonal 
approximation, however, turned out to be insufficient to reproduce full RMT result, 
whose derivation had remained a distinguished challenge for yet a long time [2111] • 
The breakthrough was achieved in 2001 when Sieber and Richter discovered a non- 
trivial mechanism of correlations between periodic orbit actions [S]. They showed 
that the next to the leading order term of the universal spectral form factor can be 
obtained by taking into account correlations between long periodic orbits with one 
self-crossing under a small angle (usually referred as encounter) and the ass! oci- 
ated partner orbits, see fig. [1^. Such pairs of orbits have close actions and contribute 
systematically into the spectral correlations. Later, this approach was extended to 
include correlations between periodic orbits having an arbitrary number of encoun- 
ters which culminated in the derivation of the full RMT result [B]. 
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In a nutshell Sieber-Richter pairs and its many-encounter analogs are nothing 
more then bunches of periodic orbits running through almost the same points of 
the phase space up to the switch in the momentum direction. Furthermore, in the 
case of broken time reversal symmetry only orbits passing close to each other with 
the same momentum direction are of relevance, see fig. [T)d. In the present paper we 
restrict consideration only to the latter case. Due to the hyperbolic nature of the 
dynamics the action difference between such trajectories is small and determined 
by the lengths of the encounters. Equally important, the correlation mechanism of 
Sieber-Richter pairs is robust to perturbations of the dynamical system. In fact, any 
hyperbolic system contains a large number of long periodic orbits with close actions 
which are not of Sieber-Richter type. However, it might be expected that, in general, 
the differences between their actions fluctuate enormously under perturbations of the 
system. Therefore, the contribution from a generic pair of periodic orbits is washed 
out after averaging (e.g., over ensemble of systems) and only pairs of Sieber-Richter 
type contribute systematically to the spectral correlations. 




Figure 1. a) Schematic picture of Sieber-Richter pair of periodic orbits 
with one encounter region being marked out by a box. b) Pair of periodic 
orbits with two encounters. As opposed to the case (a), here two orbits 
pass close to each other in the phase space of the system. Only such orbits 
contribute to the spectral correlations of systems with broken time-reversal 
symmetry [B]. 

The robustness of Sieber-Richter correlation mechanism can be easily understood 
by considering symbolic dynamics of the system. Assuming that a finite Markov 
partition exists, any periodic orbit can be encoded by a finite sequence of symbols 
X = XiX2---Xn from some alphabet |7]. The fact that two periodic orbits come 
close to each other in the phase space has a natural interpretation on the level 
of the corresponding symbol sequences. Namely, let x and y be two sequences 
of length n such that any subsequence of length p occurs in x exactly the same 
number of times as in y. This property of sequences x and y is referred below as 
p-closeness. It is straightforward to see that any pair of p-close sequences defines in 
fact a pair of periodic orbits which are close with respect to the Euclidean metric 
of the phase space. Their metric closeness is controlled by p: the larger parameter 
p is, the closer two periodic orbits approach each other. Accordingly, all periodic 
orbits of the system can be organized into a disjoint union of clusters of p-close 
orbits. Each cluster is composed of periodic orbits with close actions running through 
approximately the same points of the phase space in a different time order. 
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Motivated by the application to quantum chaos, questions regarding the number 
of periodic trajectories with close actions/lengths were previously addressed both 
in physics and mathematics literature in the context of several different models of 
chaotic dynamical systems: geodesies flows on manifolds of negative curvature [8], 
billiards IHHTO], quantum maps [TT] and quantum graphs [T21 - IT5] . In the present 
paper we study the phenomenon of clustering of periodic orbits on the level of 
symbolic dynamics. Our consideration is restricted to the simplest possible grammar 
assuming a two-letter alphabet, Xi G {0, 1}, and absence of pruning, i.e. each symbol 
in the sequence can be followed by any other symbol. These grammar rules is met, 
for instance, in the baker's map [TB]. The main question to be addressed below 
can be informally stated as follows: given an integer p, what is the probability that 
a randomly picked up periodic sequence (equiv. orbit) of length n has a certain 
number of p-close partners when n is large enough? 



2.1. The baker's map. In what follows we consider clustering of periodic orbits 
within the paradigm model of chaotic system - the baker's map T. Explicitly, 
the action of T on the points f = (q, p) of the two-dimensional phase space V = 
[0, 1) X [0, 1) is given by: 



where q and p play the role of the coordinate and momentum respectively, see 
e.g., [IS] for details. The baker's map has an advantage of having a particularly 
simple symbolic dynamics. This allows to avoid cumbersome notation and makes 
the exposition more transparent. 

2.2. Symbolic dynamics and periodic orbits. Symbolic dynamics is a standard 
tool widely used in the theory of hyperbolic dynamical systems. By this approach 
each point of the system phase space is identified with a sequence of symbols from 
a certain alphabet. Given such a representation the time evolution of the system 
takes a simple form. 

To introduce symbolic dynamics it is necessary first to define Markov partition 
of the phase space V. A standard choice for the baker's map is Vq U ^i, where 
Vo = [0, 1) X [0, 1) and Vi = 1) x [0, 1). Then any point v = {c\,p) e V can be 
uniquely encoded by a two-sided sequence of zeros and ones: x+ = xiX2 ■ ■ ■ , 

X- = . . .X-2X-1X0, Xi G {0, 1} using a simple algorithm: Xi = if T*f G Vq and 

= 1 if G Vi for i E Z. Positive i correspond to the "future" evolution of v, 
which is written in the x+ subsequence. The coordinate q expresses through this 
subsequence by the formula q = 0.x+. The "past" history of v and the momentum 
p are defined by x_ subsequence: p = 0.x_. 

Within the above symbolic representation the time evolution of the system is 
given by the shift map 



2. Definitions and main results 



(2.1) 




cr : [. . . x_ia;o.Xia;2a;3 ...]-)■[... x_ 1X0X1. 0:2X3 . . . ], 
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which moves the separation point "." between the future and the past in the sequence 
of symbols step by step. 

All infinite periodic sequences composed of one and the same finite piece x G Xn 
correspond to periodic orbits of the system. Here and below symbol X„ stands for the 
set of all possible sequences of zeroes and ones having the length n. Let 'jx denotes 
the periodic orbit of the backer's map associated with the sequence x G X„. Note, 
that two sequences x, x' correspond to the same periodic trajectory if and only if 
they are related by the cyclic shift, i.e. if x = X1X2 ■ ■ - Xn and x' = Xj+i . . . x„Xi . . . Xj 
for some i G {1, . . . n — 1} then correspond to one and the same periodic trajectory 
7x = Ix'- In what follows we will also consider the quotient set Xn '■= X„/ ~ with 
respect to the cyclic shift x ~ x'. It is convenient to think of the elements belonging 
to the set Xn as of sequences from Xn with the "glued" ends. Importantly, the 
elements from Xn and all periodic orbits having the period n are in one to one 
correspondence according to the remark made above in this paragraph. 

2.3. Clusters of periodic orbits. To define the clusters of periodic orbit, firsts, 
we need to introduce the notion of their closeness. Take two ra-periodic orbits jx, 
jy composed of n points {7x(i)}"=o^, {7y{'i)}^=o in the phase space. It is natural 
to think about two orbits jx, 1y as of close ones if in a vicinity of any point 7x(0) 
i = 0, . . .n — 1 one can find some point from the set {'yy{i)}'i=o and vice versa. 
In other words, two trajectories pass through approximately the same parts of the 
phase space but perhaps in a different order. To put this picture on a more solid 
background, we say ^x is in the p-neighborhood of 'jy if there exist exactly n pairs 
{lx{ik),'yy{i'k)), k = 1, . . .n such that for each k distance between points is bounded 
by \\lx{ik),1y{i'k)\\ ^ 2~P, where the distance in the phase space between 

V = (q, p) and v' = (q', p') is defined as := max{|q — q'|, |p — p'|}. 

The above notion of metric closeness between periodic trajectories can be carried 
over to the topological space Xn- We will say that two sequences x,y G Xn are 
p-close if any sequence of p < n consecutive symbols 0102 . . .ap, ai G {0, 1} appears 
the same number of times (which might be also zero) both in x and y. Speaking 
informally x and y are p-close if their local content (of the length p) is exactly the 
same in both sequences. This equivalence relation is denoted below as x ^ y. It is 
straightforward to see that 'jx is in the p-neighborhood of "jy whenever x ^ y. 

There are two simple but important properties of the equivalence relation x ~ ?/ 
which should be emphasized: 

m P P P 

I The relations x ~ ?/ and x ~ z also imply that z ^ y; 
II The relation x ^'^^ y implies that x ~ ?/. 

According to the first property all periodic sequences (resp. periodic orbits) can be 
separated into a number of clusters Cf^\ i = 1, . . . A/^ such that two sequences x 
and y (resp. "jx, 'jy) belong to the same cluster if and only if x ~ y. For instance, 
for given p = 2 three sequences [1101000], [1100010], [1100100] belong to the same 
cluster, see fig. [2J In a completely analogous way one can consider clusters C^^\ 
i = 1, . . .Afp of sequences from the set Xn- The connection between clusters is given 
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p=4 



Figure 2. Example of hierarchy of periodic sequences for n = 7. The 
ultrametric distance between two periodic sequences (shown in red boxes) 
is determined by the minimal cluster (blue boxes) they both belong to. 
E.g., d{x,y) = 4, d{x,z) = d{z,y) = 5 for x = [1101000], y = [1100010] 
and z = [1100100]. 



by: = / ~. In other words, each x G corresponds to a set of sequences 
from C,-^'' which are related to each other by a cyclic shift. 

The second property oi x y allows to organize the clusters of periodic sequences 
in a tree like structure. The p-th level of the tree contains clusters of p-close periodic 
sequences, see figs. [2|3l One can introduce a distance in the space of sequences based 
on this hierarchical structure. The distance d{x, y) between two elements x and y 
being proportional to the maximal level of the tree where x and y belong to the 
same cluster: d{x,y) = n — max{p\x ~ y} {d{x,x) = 0) satisfies the ultrametric 
property d{x, y) < max{(i(a;, z), d{z, y)} [18J. By the identification of periodic orbits 
1x, 1y with the corresponding sequences x,y, one can lift the distance d{x,y) to 
the space of periodic orbits. And we come to the conclusion that the space Xn (or 
equivalently the space of periodic orbits) acquires a natural ultrametric structure. 



2.4. Cluster distribution. The primary goal of the present paper is to understand 
how the distribution of the cluster sizes depends on the level p in the limit 

n — )• oo. To this end, we need to estimate the moments of the cluster sizes: 

(2.2) z, = f;icfi^ 

i=l 
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p=0 



P=l 

p=2 



mm • P=n 



Figure 3. Scheme of hierarchical structure of periodic sequences. The 
p-th level of the tree is composed of clusters (depicted by circles) of p-close 
trajectories. Each cluster at level p is split into a number of clusters at the 
next level p + 1. 



It turns out, however, that a more convenient object to consider is 
(2.3) Z, = 5^|Cf^)i^ 



i=l 

where the sum runs over the clusters of sequences from X„ (rather than sequences 
from Xn). The connection between cluster sizes in X„ and is particularly simple 
if n is a prime number. In this case for each x E Xn except x^^^ = [00 ... 0] and 
= [11 ... 1]^ there are exactly n sequences from X„ which are related by the 
cyclic shift. Two special clusters corresponding to sequences x^^\ x^^^ contain just 
one element. For all other clusters: 

(2.4) |Cf^)| =n|Cf^)|. 
This yields 

(2.5) = + 2, rf„ = + 2, 

where stands for the total number of elements in X^- In the case when n is not 
a prime number, the connection between cluster sizes in X„ and is not anymore 
trivial due to the presence of periodic orbits with a period less than n. However, 
if one includes only prime periodic orbits (whose period is exactly n) into clusters 
C'f \ C'f'^ then the connection ([23D remains valid. As a matter of fact, the exclusion 
of non-prime periodic orbits can be justified on the ground of a standard argument 
that their number is negligible in comparison with the number of prime periodic 
orbits in the limit n — t- oojj We arrive to the following relationship between (12. 2p 



""^Thc number of prime and non-prime periodic orbits scales as 2" and 2"/'', respectively, where 
r > 2 is a minimal divisor of n 
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and fl^ : 



(2.6) Zfc = -f (1 + 0{n-')) dn = -{l + 0{n-^)) . 

Note also that for a typical cluster \C\^''\ (1 + 0{n~^)) = n\cl^''\. 

It worth of mentioning that the rescaled moments have a simple interpretation 
as probabilities of finding a number of periodic orbits in the same cluster. Indeed, 
let 7j, i = 1, ... be a set of A; > 2 orbits randomly chosen from the total set 
r„ = {jxlx € of periodic orbits of length n. Then the probability that all k 
orbits belong to the same cluster is given by 

In particular, the probability V2 that two periodic orbits belong to the same cluster 
is given by /<^n- 

2.5. Main Results. The central result of the present paper is the following asymp- 
totic formula for Z^. in the limit — )■ 00: 

(2.8) ^^ = 2-(-) (-) (l + 0(„-)). 

Using then eq. (12. 6p and eq. (12. 7p we obtain the probability of finding k random 
orbits in the same cluster 

(2.9) P.= y (-) (l + 0(„-)). 

In addition, we show that the number of periodic orbits in the largest cluster Cmlx 
is asymptotically given by: 

(2.10) |C«|.(!:)(|1)^'"'(1.0(.-)). 

Based on eqs. (I2.8f2.10p we deduce probability P(t), t G [0, 1] that random periodic 
orbit from the set r„ belongs to a cluster with the size less then t|Cmax| and show 
that in the limit of n — t- 00 this probability depends only on p: 



(2.11) P{t) = [ p{T)dT, p{t) 

Jo 



(logr)' 



(2P-2 - 1)! 



As we show in the body of the paper the problem of counting cluster distribution 
of p-close periodic orbits is in fact equivalent to the one of counting degeneracies 
in the length spectrum of the so-called de Bruijn graphs JT7\. In this context the 
asymptotic behavior of Z2 and related questions have been considered previously 
in ^12-ril5j . The exact connection between our results and the above mentioned 
works is discussed in the last section of the paper. 
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2.6. Organization of the paper. The paper is organized as follows. In Sec. 3 
we show that the problem of counting cluster sizes of p-close periodic orbits in the 
backer's map can be cast in the form of counting closed paths on a certain graph 
passing through the same edges (or vertices). Using this connection we express 
as a matrix integral of certain type. In Sec. 4 we evaluate these integrals in the 
saddle point approximation and derive eq. (12. 8p . In Sec. 5 we obtain asymptotic 
formula for the size of the largest cluster Cmlx- Using then the results from Sec. 4 
we arrive to the probability P{t) of finding a periodic orbit in a cluster of the size 
smaller than t|Cmax|. Sec. 6 is devoted to the discussion of uniformity of periodic 
orbits distribution over the graph. Finally, the concluding remarks are presented in 
Sec. 7. 

3. Clusters of closed paths on graphs 

As we show below, the counting problem of p-close periodic orbits is equivalent 
to the one of counting closed paths on the de Bruijn graph Gp passing the same 
number of times through its edges. The graph Gp is constructed in the following 
way. With each sequences a = [0102 . . . Op], a G Xp we associate a directed edge Ca 
of Gp whose initial and terminal points are denoted by e^'"' and 6^°"'', respectively. 
The connections between 2^ edges are fixed by the rule: for any pair of edges e^, e^, 
defined by the sequences a = [0102 . . . Op], 6 = [6162 ■ ■ - bp] the endpoints e^'"', e|,°"'' 
belong to the same vertex if and only if Oj = fej+i for alH = 1, . . .p — 1, see fig. |H 




It is straightforward to see that any closed path g on the graph Gp passing through 
n edges can be uniquely represented by a sequence x = [xiX2 ■ ■ ■ Xn] from the set 
Xn- By such identification i'th edge of Gp passed by g corresponds to the segment 
[xjXj+i . . . Xp_i+j] of the sequence x. We will use notation g^ to denote closed paths 
corresponding to x G X„. For each closed path g^ let n{x) = {ua, a G Xp} be the set 
of integers, such that Ua is the number of times g^ passes through the edge a. Then 
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X ^ y if and only if g^, Qy go through every edge of Gp the same number of times 
(but in different time order) i.e., n{x) = n{y). Therefore, each cluster Cn of p-close 
periodic orbits is uniquely determined by the vector of integers n = {ua, a G Xp}. 

Remarks 3.1. a) By attaching a length to each edge of the graph Gp one can turn 
it to a metric graph. Note that each cluster Gn consists then of trajectories having 
the same length. Accordingly, counting of cluster sizes is equivalent to counting of 
degeneracies in the length spectrum of the corresponding metric graph. The last 
problem have been studied in flB^W^ for some classes of metric graphs, see also 
discussion in Sec. 7. 

b) Counting of closed paths passing the same number of times through the edges 
of Gp, is actually equivalent to counting of closed paths passing the same number of 
times through the vertices of twice larger graph Gp+i. Indeed, let us enumerate the 
2^ vertices of the graph Gp+i in the same way as edges of Gp, see fig. ^ By the 
identification of each closed path of the length n with a sequence x G X„ we obtain 
one-to-one correspondence between closed paths on two graphs. Correspondingly, 
clusters of closed paths have the same sizes. 

To find the size |C„| of n'th cluster we need to count the number of closed 
paths which go through the edges a G Xp of Gp exactly times. To this end 
we introduce connectivity matrix Q between edges of the graph. It is convenient 
to use for this purpose a tensorial representation for the vector space on which Q 
acts, see [HIED]. Let Ti be the 2^-dimensional linear space spanned by the vectors 
\a) = \ai) ® \a2) ■ ■ ■ ® |ap), cii G {0, 1}. The linear operator Q acts on the vectors 
from T-L according to the rule: 



(3.1) 



Q\ai) ® \a2) ■ ■ ■ ® \ap) = \a2) ® 



Note that this definition agrees with the connectivity rules between edges on the 



graph Gp, where each edge Cq 
[a2 . . . ttpO] and Ca" , 



[aia2---ap] is connected with the edges e^', 
[a2 . . . ttpl]. In addition, with each edge we associate 



a phase 0^ and define the diagonal operator A(0), := {0a|a G Xp}: 

A(0)|ai) ® laa) ■ ■ ■ ® \ap) = e'^''\ai) ® jaa) ■ • • ® \ap). 
It is straightforward to see that in the matrix form Q, A(0) can be written as [19 

(3.2) 



Q 



n 


.. 


. 


1 


.. 


• °^ 


1 


.. 


. 


1 


.. 


. 





1 . . 


. 





1 . . 


. 





1 . . 


. 





1 . . 


. 




\0 



1 
1 



1 

1/ 



A(0) 



e'^'- 


V 












\ 
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The introduction of matrices Q, A(0) is useful because of the following relation- 
ship between traces of their products and the sizes of the clusters Cn- 

(3.3) TT{QK{ct>)Y = J2 \Cr.\ exp (^(n, 0)), (n, 0) = 

n aGXp a£Xp 



with the first sum running over all clusters Cn- Eq. fl3.3p is a key component of our 
analysis, as it allows to express \Cn\ through the traces of powers of matrix QA{(/)). 
In particular, the second moment Z2 can be represented in the form of the integral 
over (ha. 



n 



2lT 







271 



^\Tt{QA{<P)) 



n\2 



(3.4) ^2 = J] \Cn? = 
Analogously, higher order moments are given by 

(3.5) ^.=Ei^^i'^=nn r^Tr(gA(0O))r5fx:0i' 

n j=l aeXp \ 1=1 



(0 



As we show below the number of integration and dimensions of matrices in 
eqs. (13. 4 [ 13. 5p can be actually reduced by the factor of two. Note that the 2^ x 2^ 
matrix Q can be represented as the product Q = RS of the matrices 



(3.6) R 



/I 





.. 






















1 





.. 


. 




n 








.. 


. 1 











•A 





1 


.. 


. 







1 





.. 


. 


1 













1 


.. 


. 


s = 








1 


.. 


. 





1 













. 













1 .. 


. 








1 










.. 


. 1 




V 
















•7 


\o 





.. 


• V 





















whose dimensions are 2^ ^ x 2^ and 2^ x 2^ ^, respectively. Changing the order of 
the matrix product in eq. (13.31) yields then 

(3.7) IV(gA(0))" = Tr(5A(0)i?)" = Tr(Q'(0))", 

where Q'{(f>) = S'A(0)i? is 2^"^ x 2^"^ matrix of the form 



(3.8) Q'(0) 





. 





gi</'2P-i+i 









. 





g*<^2P-l+2 



























g«04 








g«02P-l+4 






















. 


gj</>2P-l_i 








g«02P 


V 


. 










g«02Py 



Furthermore, using the invariance of the trace under the transformation 

Q'{cf>)^A{6)Q'{(f>)A~\d), 
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where A((5) is an arbitrary diagonal matrix, we can exclude half of the integration 
variables in eqs. (13. 4[ 13. 5p . see Appendix A. The result is the following expression 
for the moments 



(3.9) 



nn/ ^T.(Q(^»))"W5: 

j=l i=l >^0 \ '-1 



with the matrix Q{(f)) given by 



(3.10) 



Q(0) 








«03 




\ 

















1=1 



1 

1 

1 

1 



gi<^2P-i-i 

gi02P_l Q Q 









1 

1/ 



In the next section we will use eq. (13. 9 p to obtain large n asymptotic of 



4. Clustering Probability of k orbits 

In order to evaluate moments Zk we will apply saddle point approximation to 
the integral (13. 9p . where n will play the role of a large parameter. To make the 
exposition more transparent we first consider below the second moment and later 
extend the result to all > 2. 

4.1. Saddle point approximation for Z2. For k = 2 the integral (13. 9 p can be 

written as 



(4.1) Z, = (ll fj'^ -exp J-„(0), J-„(0) = \og\Ti{Q{4>)r\\ 

It is easy to see that the global maximum of J^n{,4>) is attained at = 0, where 
0i = 0, i = 1, . . . 2^"^ and J-'n(O) = nlog2. We therefore need to expand J^n{4>) 
around zero up to the second order in (p and then use saddle point approximation 
in (14. ip . To evaluate derivatives of J-'n{<P) it is convenient to use the following 
decomposition of the matrix Q{4>): 

Q(0) = A(0)go + Qi. 

Here Qq = [R 0], Qi = [0 R] are matrices composed of two blocks. The first (resp. 
second) block is given by the matrix R (from eq. (13. 6p ) of the dimension 2^~^ x 2^~^, 
while the second (resp. first) one is 2^~^ x 2^"^ matrix of zeroes (for the element- wise 
definition of Qo,Qi, see Appendix B). Straightforward calculations give then: 

^Tr(A((^)P,go(Q(0))-^) - 
Tr(Q(0))" J ' 



(4.2) 



2nRe 
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where Pj denotes projection matrix on j element of the basis, i.e., {Pj)m,i = Sm,A,i- 
It follows immediately that ^^^|</,=o = 0, as it should be for a saddle point. Taking 
an additional derivative in (14. 2 p yields 



(4.3) 



2n 



cp=0 



Efc=o Tr {R.QoQ'PjQoQ''-'-^) + 5,,,Tr {P^Q^Q^-^) 

TrQ" 

Tr(P,QoQ"-')Tr(P,QoQ"-''^ 



(TrQ 



n\2 



implying that 
(4.4) 



5'-^n(0) 



d(t)jd(t)i 

where the matrix B is defined by: 



-nBi 



cp=0 



(4.5) B = 2-^-1 (^Q + Q^ + 2I-{1 + 2p) (^^J^ , Q = Qo ^ (|) ^ • 
We can use now (14. ip to evaluate Z2 in the large n limit: 



Z2 = 2'-{1\ [ 



27r 



■ exp 



Bij(f)i(j)j 



(4.6) 



2'"(27rn)-''^ (detP)^ 1 + 



n 



1 + 



n 



The determinant of B can be explicitly calculated (see Appendix B) which finally 
leads to: 



(4.7) 



Z.in) = 2 



2n 



2P 



-1\ 2P 



1 + 



4.2. Saddle point approximation for Z^, k > 2. Our starting point is the rep- 
resentation (13. 9 p for Zk- 



(4. 



nn/ %i-exp^„({0(^)}). 



27r 



= log 

where we introduced notation: 



fc-i 



fc-1 

0:=5^0(^). 
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As in the case k = 2, the maximum of J-'n*^'' is attained when all phases vanish i.e., 
for = 0, j = 1, . . . A; — 1. After taking the first derivative of J-l'^^ we have 



(4.9) 



n 



^Tr (A(0«)p.go(Q((/,(O))n-i) 



Tr(Q(</>{0))^ 



n 



Tr(Q(-0))" 



The second derivative of J-l^'' at 0*^-'^ = 0, j = 1, . . . A; — 1 is then: 
(4.10) «^^-"('^' 



4>=o 



with the elements of the 2^'^^ x 2*''^'^ matrix B'^^^ given by: 

Note that B^^'^ can also be written as the product of two matrices composed of 
(A; - 1) X (A; - 1) blocks: 

(21 1 

(4.11) 5^ = - 

Vl 1 ... 21/ 

with 1 and standing for unit and zero 2^ x 2^ matrices, respectively. 
Applying now saddle point approximation to (14. 8 p we obtain 

(4.12) 



1 21 



1\ /5 

1 







5.. 
VO ... Bj 



■)kn 



nn 

=1 i=i 



2- ' 



27r 



■ exp 



1 + ( - 



2'^"(27rn 



,-(fc-l)2f- 



(detS^'^))^ 



1 + ( - 



By eq. (14. lip the determinant of B^'^^ can be explicitly evaluated: 

2P 



detB'^^^ 



k 



)fc-i 



(det5 



.k-l 



Substituting this expression into eq. (I4.12p gives then 

22 



(4.13) 



A;2 



p-2 



O 



To verify the above asymptotic formula we evaluated the ratio between the leading 
order term of (I4.13P and the exact value of obtained by numerical calculations of 
\Cn\. The results for p = 3 are presented on fig. As can be observed, this ratio 
is close to one for large n. 
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30 40 50 60 70 80 90 100 

n 



Figure 5. The upper five curves show the ratio of the exact moments 
Zfc, k = 2, ... 5 to the asymptotics ()4.13p as a function of n in the case 
p = 3. They are ordered with respect to k with the most upper curve 
corresponding to /c = 2. The dashed curve at the bottom of the plot is the 
ratio of the exact size of the largest cluster to its asymptotic value (|2.10p 
measured at the points n which are multiplies of eight. 



5. Distribution of cluster sizes 

We will use now the results of the previous section to evaluate the probability 
P{t) that a randomly chosen periodic orbit belongs to a cluster of the size t|Cmax|, 
t e [0, 1], where |Cmax| = niax„ \Cn\ is the size of the largest cluster. To this end we 
first need to establish the asymptotic behavior of |Cmax|- 

By eq. fl3.3l) the maximal size is given by: 

(5.1) |Cmax| = max ( n /" ^ ) ■ ^^P'^(<^)' 




'0 



i(n» + logTr(QA(</))). 



To evaluate the above expression we can once again use saddle point approximation. 
Taking the first derivative of J^{4>) leads to the following equation for the saddle 
point 

(5.2) ^^^^^-M/^WPAiQWr-') 



with Hi being i's component of the vector n. It can be expected that the cluster 
of the maximum size is provided by the most homogeneous vector n among all 
admissible vectors n. For the sake of simplicity take n, such that n mod 2^ = 0. 
In this case n = 2~^(1, !,...!) and the saddle point equation (15. 2p is satisfied when 
= 0. At this point J^{0) = nlog2. Since |v^(</>)| < nlog2 for any 0, it is clear 
that Cfi is, indeed, has (at least asymptotically) the largest size among all clusters. 
Repeating then the same calculations for the second derivative of J-'{4>), as in the 
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previous section, we obtain after saddle point approximation: 
(5.3) |C„,ax| = ^2 (n/2) (1 + (1/n)) , 

where Z2{n) is given by eq. fl4.7p . 

To calculate the probability P{t), it is useful to notice that the moments can 
be represented as integrals: 

(5-4) = ['dtp{t)t'-\ 



nn\n \k-i I 

^ I '—'max I JQ 

where p{t)At is the probability to find a random periodic orbit in a cluster C„ whose 
size belongs to the interval |Cmax| ■ t < \Cn\ < |Cmax| ■ (t + At). The probability 
to find a random periodic orbit in the cluster of the size smaller than [Cmaxl^ is 
therefore: 



Pit) 



It remains to find p(t). After substituting into eq. (15. 4 p the asymptotic expressions 
for Zk and ICmaxI "we obtain in the limit n — )■ oo: 



(5.5) 



dtp{t)t 



fe-i 



k 



-2P- 



By taking the Laplace transform on both sides of eq. (15. 5 p one has 

(logt) 



(5.6) 



Pit) 



(2P-2 _ 1)1 • 
In particular, for p = 2,3 this gives: 

(5.7) Pit)=t (forp = 2); P(t) = t(logt - 1) (for p = 3) . 

The comparison of the above result with the direct numerical simulation is shown 
on fig. ini 




Figure 6. The exact distribution of cluster sizes for n = 70 (upper 
green) and n = 47 (lower red) is shown in comparison with the asymptotic 
expression P{t) = t(\ogt — 1) (dashed blue line) in the case p = 2> 
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6. Anisotropy properties of clusters 

The method used in the previous sections to calculate moments can be also 
applied to obtain a more refined information on the distribution of periodic orbits in 
clusters with regard to each edge individually. One can ask for instance how many 
times a sequence a G Xp appears in a random sequence x of length n. As one can 
expect, if all orbits are weighted in the same way the result does not depend on a: 

(6.1) < ria >:= 2-"5^nJC^| = -2-"z9<^„Tr (QA(0))" |^=o = n/2P. 

n 

In other words, the periodic orbits are ergodically distributed over the graph Gp. 
However, if only periodic orbits from large clusters (or small clusters) are consid- 
ered the question regarding homogeneity of their distribution over the edges of the 
graph, does not seem to have a trivial answer. For instance it is clear that smallest 
clusters are dominated by periodic orbits with either large number of zeroes or ones. 
Therefore, among the periodic orbits belonging to small clusters there should be 
enhanced probability to meet a subsequence consisting of all zeroes (or ones). 

The purpose of the present section is to investigate the anisotropy properties of 
the graph with regard to distribution of periodic orbits belonging to clusters of 
different size. To this end let us consider the sum 

(6.2) < ria >2: 







2 






2 



As opposed to < Ua > the above quantity gives a larger weight to the periodic orbits 
from larger clusters. Thus, in principle, < Ua >2 might depend on a. To verify this, 
we observe that (16. 2p can be written as 



J^^^\cf>) = log [Re(-z9<^„Tr(Q(0))"Tr(Q(-0))")] 

= log [nRe((a|A((/))QoQ(0)""V)Tr(Q(-0))' 

where \a) is the state corresponding to the sequence (edge) a. We can now apply 
saddle point approximation to eq. (16. 3p . Expanding J-'n{4>) up to the second order 
in 0j's gives 



(6.4) 



Jl«)(0) = log(n2^"-^) - ^ (j2b,,Mj + 0{<f>t)^ 



with the matrix B given by eq. (14. 5p . Substituting then (16. 4p into (16. Sp leads to 
(6.5) < n„ >2= n/2P + 0(n°), 

where only the subleading term (possibly) depends on a. This form of < ria >2 indi- 
cates that to the leading order in n the periodic orbits are equidistributed uniformly 
over the graph. The asymmetry shows up only in the second and higher order terms 
of the asymptotic expansion. In fig. [7] we plotted < Ua >2 as a function of n for 
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different edges of tlie grapli. It is clearly visible that existing asymmetry between 
different edges, essentially, does not grow with n. 



-0. 
-0.15 
-0.2 

-0.25 

-4'. 



55 60 65 70 75 

n 





Figure 7. Distribution of periodic orbits over edges of the graph for 
p = 3. The upper figure shows the graph of < >2 as a function of n 
for three different edges ai = [Oil], 02 = [101] and 03 = [000]. The lower 
figures show the graphs (for the same edges) of na{t) for n = 82 (left) 
and n = 70 (right), respectively. Here the doted lines correspond to the 
constant value n/8. Note, that up to a shift in the vertical direction, these 
two graphs are close to each other. 



In an analogous way one can estimate averages: 

(6.6) < Ha >k--- 





\Cn 




y 1 


r 1 


k 



for an arbitrary k. As in the case of = 2, the leading order term of the asymptotic 
expansion of < >fc is equal to This can be interpreted to the extent 

that periodic orbits uniformly (to the leading order of n) pass through the edges 
of the graph Gp, independently of the size of the clusters they belong. The non- 
uniformity, however, does appear in the next order n° of the asymptotic expansion. 
To demonstrate this, we calculate numerically the averages: 



(6.7) 



Uait) :-- 



E 

\Cr,\<t\Cn 



E 

I Cn I ^ I Cm ax | 



Cr, 
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where the sum runs over clusters with sizes less than tjCmaxI, t G [0, 1]. It follows 
straightforwardly from the definition of na{t) and eq. (16.11) that 

J2na{t)=n forte [0,1], n,(l) = n/2P. 

In the case p = 3, the resulting plot of na{t) is shown on fig. ([7]) for n = 82 and 
n = 60, respectively. As one can see, there is enhanced probability for periodic 
orbits from small clusters t <^ 1 to pass through the edge a = [000] , and suppressed 
probability to pass through the edge [Oil]. On the other hand this picture depends 
very little on n. 

7. Discussion 

To summarize, the problem of counting p-close periodic orbits of the backer's map 
can be cast in an equivalent form of finding degeneracies in the length spectrum 
of the de Bruijn graph Gp. The latter problem has previously attracted attention 
in [13] , where an asymptotic expression for Z2 has been obtained for a generic graph. 
Related counting problems were also considered in [12] for fully connected graphs 
and in [13] for binary directed graphs. In comparison to [T3] we go somewhat further, 
as we derive asymptotics for all moments Zk and explicitly obtain the leading term 
prefactors depending on p. Our considerations have been restricted to a simplest 
possible symbolic dynamics which occurs in the backer's map (leading to a specific 
binary graph). As a matter of fact, the results can be straightforwardly generalized 
to the alphabets with a larger number of symbols. Moreover, the present approach 
can be extended to the symbolic dynamics with non-trivial grammar rules, when 
certain symbolic subsequences are forbidden. In that case the resulting graph Gp 
would typically have a non-homogeneous structure, where the number of outgoing 
and incoming edges depend on a specific vertex. 

The rescaled moments Zk can be interpreted as the probabilities Vk that k ran- 
domly chosen periodic sequences are p-close. Making use of information on Vk, we 
obtain the asymptotics of the probability P(t) to find a random periodic sequence 
in a cluster of the size smaller than t|Cmax|, t < 1, with C^ax being the largest clus- 
ter. Most significantly, in the large n limit, P{t) does not depend on the length of 
sequence, but only on p. It is worth noting that P{t) is basically determined by the 
distribution of "large" clusters, whose size is of the order |Cmax|- "Small" clusters 
Ci, whose size is \Ci\ ^ jCmaxI do not affect (asymptotically) the moments Zk and 
therefore, do not contribute to P{t). A natural question arises: what is the number 
of "large" clusters in comparison to the number of "small" clusters? As we show in 
Appendix C the total number of clusters N'p is proportional to n^^ . On the other 
hand, based on eqs. (15.31 14.13"|) a rough estimation of "large" clusters yields 

AAW ^ Zk/\C^..\' ~ n'^~\ 

This shows that the number of "large" clusters scales as AfP ~ In other 

words most of the clusters are "small" in the limit n — )• 00, although their influence 
on Zk is negligible. 
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We have also studied the distribution of the periodic orbits over the graph Gp. 
A priori, it might be expected that subsequences consisting of only zeroes (or ones) 
appear more often than others in periodic sequences belonging to "small" clusters. 
As we show, this is indeed so, but such inhomogeneities vanish asymptotically. To 
the leading order of n periodic orbits cover the graph Gp uniformly. In other words, in 
long sequences belonging to clusters of size comparable with |Cmax| each subsequence 
of p symbols occurs on average the same number of times. 

One of the main motivations for the present study comes from the theory of 
quantum chaos, where clusters of p-close orbits play an important role. It is worth 
mentioning, however, that the limit considered in the present paper, n — oo with 
a fixed p, is somewhat different from the usual semiclassical limit. In the last case 
spectral correlations are determined by pairs of long periodic orbits whose encounter 
lengths are proportional to the logarithm of their total lengths [6j . This corresponds 
to the limit of both n — )■ oo, p — )■ oo such that the ratio is fixed. In this limit one 
might expect completely different behavior for Vk- In particular the number of p- 
close pairs should grow faster than 2". The exact behavior of Vk in the semiclassical 
limit is of great interest and we leave it for future investigations. 
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Appendix A. Change of integration variables in ( 13.41 ) and ( 13.51 ). 

Here we demonstrate how the total number of integration variables in the inte- 
gral (13. 4p can be effectively reduced by half. To perform the change of variables we 
use the invariance of the trace under the multiplication of each matrix QK{cf)) by 
a diagonal matrix A(^) = diag {e*^^, . . . , e'^^p | an^l its conjugate from the left and 
from the right side correspondingly: 

Q'A(0) -> e-^«A(^)Q'A(0)A(C)^ 

Any such transformation with an arbitrary ^ = (.^i, . . ■S,2v-^) and ^ leaves the in- 
tegrand in (13. 4[ 13. 5p intact. Note also that this transformation does not affect the 
structure of the matrix (13.81) . but only its phases. Using this one can eliminate the 
phases (p'l = 02P-i+i5 • • • 02P-i = *^2P on the "right side" of the matrix Q' by imposing 
the system of linear equations on ^k, k = 1, . . . , 2^~^ 

(A.l) 02fc-i - 6fc-i + 6p-i+fc - ^ = 0; 

On the "left side" of the matrix Q' the above transformation induces new phases 

(A.2) if2k-i = <p2k-i - 6fc-i + 6 - ^; 

V^2fc = 02fc — ^2k + ^k ~ ^, 

where ^j's are determined by eq. (lA.ip . After fixing the common phase ^ to be 
a symmetric combination of other variables ^k, i-e. C = '^~^Ylk^k system of 
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equations flA.ltlA.2]) can be cast into the matrix form: 



(A.3) c^' = F^; 

(A.4) <p = 0-G|, 

where the matrices F and G are expressed in terms of Qq, Qi, and Q: 
(A.5) G = l-Qo + {Q/2Y; F = t-Qi + (Q/2)^ 



Combining now eq. flA.3l) with eq. flA.4l) we define the following linear transfor- 
mation 

cf>\_fl GF-' \fcp 
0' i - I 1 [cp' 



(A.6) 



which exists since the determinants of both F and G are different from zero (see 
proposition IB.ip . By construction, the resulting matrix e~*^A(^)(5'A(0)A(^)"'' has 



the required form fl3.10l) with the phases given by ip. Furthermore, since the Jacobian 
of the linear transformation flA.6p equals one, the variables cp' do not enter the 
integrand at all and can be integrated out. In the case of higher moments Zk, k > 2 
one has to perform the similar transform for each set of variables (p^^^ The resulting 
expression for the integral is given then by the equations (13.91) and (13.101) . 

Appendix B. Calculation of the determinant of the matrix (14.51) 

In this appendix we calculate the determinant of the matrix B. We recall that B 
is defined using 2^ x 2'p matrices Qq, Qi, their sum Q = Qo + Qi and Qp = 
see (14. 5p . For the sake of completeness, we give their element- wise definition: 

• The entries of the matrices Q and Qq, Qi are given by = 1, . . . 2^^): 

2P-1 

(B.l) [Q]i j = ^(5i,2fc-l + Si,2k){Sj,k + 5j,2P-l+fc) 

k=l 

2P-1 2P~i 

(B.2) [Qo]i j = ^^(5i,2fc-l + 5i,2k)5j,k'-, [Ql]ij = ^^('^j,2fc-l + 5i,2fc)5j,2P-i+fc- 

fc=l k=l 

• All elements of matrix Qp are equal to 2~^. 

Bellow we collect a number of matrix relations which are of use in the further 
analysis. They follow directly from the matrix definitions. For any natural k: 

(B.3) Q Qp = QpQ = 2 Qp] QoQp ~ QiQp ~ Qp] Qp ~ Qp 



:p} 



(B.4) Q^Qo + QlQi = 2-l. 

For the traces of various matrix products we have: 
(B.5) TrQpQl, = TiQ^, = 1, TtQ' 

where (and further) (5o,i stands either for Qq or Qi. 
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Two more, useful equalities can be derived by using fIB.Sp 

(B.6) {QoA-Qp)" = {Qo,i-Qp)Ql:i\ 

(B.7) {Q-Qpf = g'^ - (2'= - i)gp. 

Both can be proved by induction. To derive (IB.6P it is enough to notice that 
QpiQoA — Qp) = 0. To obtain the coefficient at Qp in eq. flB.71) we assume that 
{Q — Qp)^ = — a^Qp with some yet unknown a^. Then the next iteration gives 
the following recurrence relation for a^: a,t+i = 2^^ + a^.. Its solution is = 2^^ — 1 
as it is indicated in f lB.Tp . 

Having described the properties of the matrices Qo,i, Q and Qp we come to cal- 
culation of their determinants. The ffist axillary statement is: 

Proposition B.l. For any real a 

(B.8) det [1 - «(go,i - Qp)] = 1, det [1 - a{Q - Qp)] = 1 - a. 



Proof. Consider the logarithm of the determinant: 

oo ^ 

logdet[l - a(go,i - Qp)] = -J^ ^^^^^ " ^p)'- 

k=l 

Applying now the formula (]B.6p we come to the matrix (go — Qp)Qq which trace is 
zero for all /c > due to the relations (IB. 51) . This proves the ffist statement. In the 
same way, using the equation (]B.7p and the information about traces we derive 

oo ^ 

log det [1 - a{Q - Qp)] = - ^ ^IV (g - Qp)" = log(l - a). 

k=l 

□ 

We can use the above proposition for a = 1 to obtain the determinants of the 
matrices F and G defined in (lA.SP : 

Corollary B.2. 

det F = det G = 1 



In the remaining part of the appendix the determinant of 2^ x 2^ matrix f l4.5p 

B = 2--^ (^21 - (1 + 2p)g, + go £ (I) ' + £ (^) ' Q^j , 

is evaluated. The following remark helps to perform the calculations. 

Remark B.3. In the saddle-point calculations of the integral Ii3.9\) made after the 
change of variables one comes to the Gaussian integral ( (^.6| j involving matrix B . 
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However, change of the order leads to another matrix, 2 ^Mp+i 
dimensionality and connected to the matrix B by the relation 



It has twice larger 



(B.9) 2-^Mp+i 



1 





1 



B 




1 



B 



BGF-^ 

T - 



-{GF~^y B {GF-^yBGF-^ 



Explicit form of matrix Mp+i is 



(B.IO) 



M 



p+i 



1 



E 

r=l I- 



+ 



2 



1) 



where all matrices are of the size 7P^^ x 7F^^ . 

Eq. (IB.9P allows to cast the problem of calculation of det B into a "symmetric" 
form. Namely, by the following proposition the determinant of B can be expressed 
through the eigenvalues of two matrices which depend only on Q and . 

Proposition B.4. The determinant of the matrix B can he represented as 



(B.ll) 



det B = 1 



-2P 



det M„ ' 



where Xj are non-zero eigenvalues of 2^+^ x 2^^^ matrix Mj 
matrix Mp := 1 + Mp + 3Qp. 

Proof. Consider the spectral problem for the matrix Mp+i 



p+i and Mp is 2^ x 2^ 



P+i 



X 
Y 



X 



X 
Y 



BX - BGY = XX 
-G^BX + G^BGY 



XY. 



where G = GF ^. The latter system reduces to the equation 



X{G^X + Y) 



0, 



meaning that either A = or —G^X = Y. Backward substitution results in the 
following eigenvalue problem 

B{1 + GG^)X = XX 
where A's are non-zero eigenvalues of the matrix M, so that we can write down 



detS 



det{l + GG^) detiF^F + GTG)' 

In the last expression the result of the proposition IB.ll has been used. 

One can proceed further by evolving the denominator. First we observe that it 
depends only on the matrix Q. Indeed, due to the properties fIB.Sp . flB.4p one has 



F^F + G^G = A-1-Q-Q^ + 2Qr 



COUNTING OF SIEBER-RICHTER PAIRS 



23 



Expansion of each of the fractions with the help of equation flB.7p 



2^2 fc=0 fc=0 ^ 

yields 

(B.12) F^F + G^G = 2 (^1 - I + (1 + M, + SQ,) (l-^ + 9l. 

where all matrices are of the size 2^ x 2^. By taking into account (IB.SP we arrive to 
the final expression for the determinant of S. □ 

It remains now to calculate the spectrum of matrices Mp+i and Mp. Note that 
both matrices have a similar structure and can be treated in the same way. Their 
spectra is provided by the following lemma. 

Lemma B.5. The spectrum of the matrix Mp^i contains 2^ (half of the total num- 
ber) zero eigenvalues, the non-zero part of the spectrum has the following structure: 

Afc = + 1 with multiplicity 2^^'^^^, < k < p — 1, 

and Ap = p + 1 has multiplicity one. The non-trivial eigenvalues of Mp are given by: 

Xk = k + 2 with multiplicity 2^"''"^, < k <p - 2, 

and Xp-i = p+1, Ap = 4 having multiplicity one. The remaining 2^"^ — 1 eigenvalues 
are all equal to one. 

Proof. To prove this lemma it is convenient to work within the tensorial represen- 
tation of matrices. According to fl3.ll) the matrix is given by 

(B.13) = -T^ 

\ r times / 

where T is the following shift operator: 

T\ai) (g) \a2) ■ ■ ■ ® \ap) = \a2) ® • • ■ ® \ap) ® |ai), 

and the projection s acts on |0), |1) as 

s|0) = 0; s|l) = |l). 

Note that the result of the action of the operator on the basis vectors ® 
IJ2) ® ■ ■ ■ ® \ jp), jk £ {0, 1} essentially depends on the number of consecutive "ones" 
at the end of the sequence jij2 ■ ■ ■ jp- Whenever this number is larger than r, the 
result of the action of is zero. This property allows to find all eigenvalues of the 
operator Mp+i. 

Let x^''^ be a vector having exactly k "ones" at the right end of the encoding 
sequence, separated by two "zeroes" (if k < p) from the rest of the sequence , i.e., 

^i^) = |0)®a;p_fe_i(g)|0)(g)|l)(g)---® |1), 

^ V ' 

k times 
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where Up_k-i = ® \j2) (S) ■ • • (8) \jp-k-i) is an arbitrary product vector of the 
length p — k — 1. By the definition the vector consisting of only "ones" . It 

is easy to see that x^^^^^ is an eigenvector of Mp+i with zero eigenvalue. All other 
eigenvectors of Mp+i can be constructed as linear combinations of the rotations of 



k 

(B.14) ^a.T^X^'^. 

j=0 

The action of Mp+i on each of these combinations {k < p) results in 



i=0 \i=0 / \j'=0 




Equating the right hand side of this expression with A Ylj=o ^j'^^X^^^ we rewrite the 
original problem as the eigenvalue problem for the k x k matrix consisting of all 
ones: 

k 

aj = Aftfc , k = 0,l,...,p. 

j=0 

The solution is well known - all eigenvalues are zeros except one which is equal to 
k + 1. The degeneracy of the eigenvalues is defined by the free part Up-k-i of the 
vector x^''^ and, therefore, equals to 2^~*^~^ for < k < p — 1 and to one for k = p. 
The total number of non-zero eigenvalues is 

p-i 

k=0 

The eigenvalues of Mp can be obtained by a general shift of all eigenvalues of Mp 
by 1 with the only exception of zero eigenvalue corresponding to x^^^ which should 
be shifted by 4. □ 

By Lemma IB.5I one has the following chain of identities 

p-l p-2 

(B.15) n =(p+^) + 1)'""' = (p + 1) + 2)'"'" = 4 ^et k. 

AjT^O fc=0 k=0 

Substituting this into (IB. lip gives the final expression for the determinant of B: 
Proposition B.6. 
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Appendix C. Calculation of the total number of clusters 

Here we estimate the total number Mp{n) of equivalence classes Cn of all sequences 
of the total length n generated by the equivalence relation x ^ y {x,y & Xn). 
Recall that every equivalence class is uniquely parametrized by a vector of integers 
^ = {^a,o € ^p}- The elements of this vector determine the number of times a 
periodic orbit from Cn passes trough the corresponding edge of the graph Gp. Since 
vector n corresponds to a real periodic orbit of the length n, its components must 
satisfy the following constraints: 

i. The total length of the trajectory is fixed: 
(C.l) J] n„ = n. 

ii. The number of times a periodic orbit enters a vertex of the graph Gp must be 
equal to the number of exits from the same vertex. This balancing condition 
is represented by the equation 

(C.2) Sn = n^R. 

It is easy to see that for any vector n satisfying the above conditions, with 7^ 
for all a, it is possible to find a closed path on the graph which passes through an 
edge a exactly times. Note also that the number of solutions with = for 
some a is smaller by a factor 1/n than the total number of solutions of (]C.l|IC.2p . 
see [15]. Therefore, to find the leading asymptotics of Mpin) it is sufficient to count 
vectors of positive integers satisfying the equations (lC.lllC.2p . 

Since the system (lC.2p is composed of 2^~^ — 1 linearly independent conditions, 
we can chose 2^~^ first elements of n freely, while the rest is then uniquely fixed 
by eqs. (lC.l|C.2p . In addition, the constraints > 0, i = 1, . . . 2^ must be sat- 
isfied. These constraints define a 2^~^-polytope Vp in the 2P~^-dimensional space 
of ni, . . .n2P-i. Geometrically the number of clusters Npin) can be interpreted as 
the total number of points with integer coordinates encompassed by Vp. Accord- 
ingly, the leading term of Mp{n) in the large-ra limit is given by the volume of Vp. 
Therefore, 

(C.3) Up{n) = Wpn^"~\l + 0{l/n)), 

where the coefficient Wp can be calculated explicitly for low values of p. We illustrate 
this by the following example. 

Example: For p = 2 the conditions (]C.lfC.2p take the form: 

n2 = ns, rii + n2 + + = n. 

We chose two independent integer be ni = k, n2 = m. Since nj > for all i = 
1, 2, 3, 4, the problem is reduced to calculation of the area of the triangle 

V2 = {x>0,y>0,n-2x-y>0}, 

in the (x, ?/)-plane. As a result, for p = 2 we obtain A/2(n) = -|- 0{n). 

It worth mentioning that similar problem for non-directed graphs was considered 
in [15] . It was shown that the number of equivalence classes (equiv. the degeneracy 
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classes in the length spectrum of the graph) in the leading order of n is proportional 
to n'^l"^, where \E\ is the total number of edges in the graph. For comparison 
with fIC.Sp . note that the number of edges in Gp is 2^. This reflects the fact that 
the number of equivalence classes in directed graphs is essentially smaller than in 
non-directed graphs. 
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